A Comparative Study on the Application of Hierarchical-Agglomerative Clustering Approaches to Organize Outputs of Reiterated Docking Runs

نویسندگان

  • Giovanni Bottegoni
  • Andrea Cavalli
  • Maurizio Recanatini
چکیده

Reiterated runs of standard docking protocols usually provide a collection of possible binding modes rather than pinpoint a single solution. Usually, this ensemble is then ranked by means of an energy-based scoring function. However, since many degrees of approximation have to be introduced in the computation of the binding free energy, scoring functions cannot always rank the experimental pose among the top scorers. Cluster analysis might help to overcome this limit, provided that data clusterability has been earlier assessed. In this paper, first, we present a modified version of a test earlier developed by Hopkins to assess whether or not docking outputs show the natural tendency to be grouped in clusters. Then, we report the results of a comparative study on the application of different hierarchical-agglomerative cluster rules to partition docking outputs. The rule that was able to best manage the observed data was finally applied to the whole ensemble of poses collected from several docking tools. The combination of the average linkage rule with the cutting function developed by Sutcliffe and co-workers turned out to be an approach that meets all of the criteria required for a robust clustering protocol. Furthermore, a consensus clustering allowed us to identify the pose closest to the experimental one within a statistically significant cluster, whose number was always of few units.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

AClAP, Autonomous hierarchical agglomerative Cluster Analysis based protocol to partition conformational datasets

MOTIVATION Sampling the conformational space is a fundamental step for both ligand- and structure-based drug design. However, the rational organization of different molecular conformations still remains a challenge. In fact, for drug design applications, the sampling process provides a redundant conformation set whose thorough analysis can be intensive, or even prohibitive. We propose a statist...

متن کامل

A Comparative Agglomerative Hierarchical Clustering Method to Cluster Implemented Course

There are many clustering methods, such as hierarchical clustering method. Most of the approaches to the clustering of variables encountered in the literature are of hierarchical type. The great majority of hierarchical approaches to the clustering of variables are of agglomerative nature. The agglomerative hierarchical approach to clustering starts with each observation as its own cluster and ...

متن کامل

به کارگیری روش‌های خوشه‌بندی در ریزآرایه DNA

Background: Microarray DNA technology has paved the way for investigators to expressed thousands of genes in a short time. Analysis of this big amount of raw data includes normalization, clustering and classification. The present study surveys the application of clustering technique in microarray DNA analysis. Materials and methods: We analyzed data of Van’t Veer et al study dealing with BRCA1...

متن کامل

An Empirical Comparison of Distance Measures for Multivariate Time Series Clustering

Multivariate time series (MTS) data are ubiquitous in science and daily life, and how to measure their similarity is a core part of MTS analyzing process. Many of the research efforts in this context have focused on proposing novel similarity measures for the underlying data. However, with the countless techniques to estimate similarity between MTS, this field suffers from a lack of comparative...

متن کامل

Implementation of Hybrid Clustering Algorithm with Enhanced K-Means and Hierarchal Clustering

We are propose a hybrid clustering method, the methodology combines the strengths of both partitioning and agglomerative clustering methods. Clustering algorithms that build meaningful hierarchies out of large document collections are ideal tools for their interactive visualization and exploration as they provide data-views that are consistent, predictable, and at different levels of granularit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of chemical information and modeling

دوره 46 2  شماره 

صفحات  -

تاریخ انتشار 2006